A hidden challenge of link prediction: which pairs to check?

نویسندگان

چکیده

The traditional setup of link prediction in networks assumes that a test set node pairs, which is usually balanced, available over to predict the presence links. However, practice, there no set: ground truth not known, so number possible pairs quadratic nodes graph. Moreover, because graphs are sparse, most these will be Thus, methods, often rely on proximity-preserving embeddings or heuristic notions similarity, face vast search space, with many close proximity, but should linked. To mitigate this issue, we introduce LinkWaldo, framework for choosing from quadratic, massively skewed space concise candidate that, addition being also structurally resemble observed edges. This allows it ignore some high-proximity low-resemblance and identify high-resemblance, lower-proximity pairs. Our built model theoretically combines stochastic block models (SBMs) proximity models. structure SBM maps out where new links expected fall, identifies plausible within blocks, using locality sensitive hashing avoid expensive exhaustive search. LinkWaldo can use any representation learning definition generate method, allowing power current future methods realized practice. We evaluate 13 across multiple domains show average returns sets containing 7–33% more missing than both embedding-based baselines’ sets. code at https://github.com/GemsLab/LinkWaldo.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

hidden caries: a new challenge for dentists

purpose: hidden caries is a term used to describe occlusal dentine caries that is missed on a visual examination, but can be detected by bitewing radiographs. the aim of this study is the review of new studies about this lesion and presentation the ways for diagnosis, prevention and treatmen of this lesion. review of literature: investigations believe that the etiology of this lesion is the mor...

متن کامل

Hidden space reconstruction inspires link prediction in complex networks

Hao Liao, Mingyang Zhou, Zong-wen Wei, Rui Mao,Alexandre VidmerYi-Cheng Zhang 1 Guangdong Province Key Laboratory of Popular High Performance Computers, College of Computer Science and Software Engineering, Shenzhen University, Shenzhen 518060, P.R. China 2 Department of Physics, University of Fribourg, Chemin du Musée 3, CH-1700 Fribourg, Switzerland 3 Department of Modern Physics, University ...

متن کامل

Hidden link prediction based on node centrality and weak ties

Link prediction has been widely used to extract missing information, identify spurious interactions, evaluate network evolving mechanisms, and so on. In this context, similaritybased algorithms have become the mainstream. However, most of them take into account the contributions of each common neighbor equally to the connection likelihood of two nodes. This paper proposes a model for link predi...

متن کامل

Grand Challenge: Model Check Software

Model checking has been successfully employed for verification of industrial hardware systems. Recently, model checking techniques have also enjoyed limited success in verifying software systems, viz., device drivers. However, there are several hurdles which must be overcome before model checking can be used to handle industrial-scale software systems. This article reviews some of the prominent...

متن کامل

A Hidden Challenge to the Regulation of Data Surveillance

Profiling is a data surveillance technique which is little-understood and ill-documented, but increasingly used. It is a means of generating suspects or prospects from within a large population, and involves inferring a set of characteristics of a particular class of person from past experience, then searching data-holdings for individuals with a close fit to that set of characteristics. It is ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Knowledge and Information Systems

سال: 2022

ISSN: ['0219-3116', '0219-1377']

DOI: https://doi.org/10.1007/s10115-021-01632-x